Adaptive ML-weighting in multi-band recombination of Gaussian mixture ASR
نویسندگان
چکیده
Multi-band speech recognition is powerful in band-limited noise, when the recognizer of the noisy band, which is less reliable, can be given less weight in the recombination process. An accurate decision on which bands can be considered as reliable and which bands are less reliable due to corruption by noise is usually hard to take. In this article, we investigate a maximum-likelihood (ML) approach to adapting the combination weights of a multi-band system. The Gaussian Mixture Model parameters are kept constant, while the combination weights are iteratively updated to maximize the data likelihood. Unsupervised offline and online weights adaptation are compared to use of equal weights, and ‘cheating’ weights where the noisy band is known, as well as to the fullband system. Initial tests show that both MLweighting strategies show a robustness gain on band-limited noise.
منابع مشابه
Spectral Entropy Feature in Multi-Stream for Robust ASR
In recent papers, entropy computed from sub-bands of the spectrum was used as a feature for automatic speech recognition. In the present paper, we further study the sub-band spectral entropy features which can give the flatness/peakiness of the sub-band spectrum and in turn the position of the formants in the spectrum. The sub-band spectral entropy features are used in hybrid hidden Markov mode...
متن کاملMAP combination of multi-stream HMM or HMM/ANN experts
Automatic speech recognition (ASR) performance falls dramatically with the level of mismatch between training and test data. The human ability to recognise speech when a large proportion of frequencies are dominated by noise has inspired the “missing data” and “multi-band” approaches to noise robust ASR. “Missing data” ASR identifies low SNR spectral data in each data frame and then ignores it....
متن کاملMulti-band speech recognition in noisy environments
This paper presents a new approachfor multi-band based automatic speech recognition (ASR). Recent work by Bourlard and Hermansky suggests that multi-band ASR gives more accurate recognition, especially in noisy acoustic environments, by combining the likelihoods of different frequency bands. Here we evaluate this likelihood recombination (LC) approach to multi-band ASR, and propose an alternati...
متن کاملAsynchrony with trained transition probabilities improves performance in multi-band speech recognition
One of the central themes in multi-band automatic speech recognition (ASR) is to devise a strategy for recombining sub-band information. This in turn raises two questions: (1) at what phonetic unit should the recombination take place? (2) How asynchronously should the sub-bands be run? Theoretically asynchronous multi-band ASR should perform at least as well as synchronous multi-band ASR. Howev...
متن کاملNoise Robust Speaker Identification Using Sub-Band Weighting in Multi-Band Approach
Recently, many techniques have been proposed to improve speaker identification in noise environments. Among these techniques, we consider the feature recombination technique for the multi-band approach in noise robust speaker identification. The conventional feature recombination technique is very effective in the band-limited noise condition, but in broad-band noise condition, the conventional...
متن کامل